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ABSTRACT 

We review methods for measuring the sizes, line widths, and luminosities of giant molecular clouds 
(GMCs) in molecular-line data cubes with low resolution and sensitivity. We find that moment 
methods are robust and sensitive — making full use of both position and intensity information — and 
we recommend a standard method to measure the position angle, major and minor axis sizes, line 
width, and luminosity using moment methods. Without corrections for the effects of beam convolution 
and sensitivity to GMC properties, the resulting properties may be severely biased. This is particularly 
true for extragalactic observations, where resolution and sensitivity effects often bias measured values 
by 40% or more. We correct for finite spatial and spectral resolutions with a simple deconvolution and 
we correct for sensitivity biases by extrapolating properties of a GMC to those we would expect to 
measure with perfect sensitivity (i.e. the K isosurface). The resulting method recovers the properties 
of a GMC to within 10% over a large range of resolutions and sensitivities, provided the clouds are 
marginally resolved with a peak signal-to-noise ratio greater than 10. We note that interferometers 
systematically underestimate cloud properties, particularly the flux from a cloud. The degree of bias 
depends on the sensitivity of the observations and the (u, v) coverage of the observations. In the 
Appendix to the paper we present a conservative, new decomposition algorithm for identifying GMCs 
in molecular- line observations. This algorithm treats the data in physical rather than observational 
units (i.e. parsecs rather than beams or arcseconds), does not produce spurious clouds in the presence 
of noise, and is sensitive to a range of morphologies. As a result, the output of this decomposition 
should be directly comparable among disparate data sets. 
Subject headings: ISM:clouds — methods:data analysis — radio lines:ISM 



1. INTRODUCTION 

Over the last 15 years, it has become possible to ob- 
serve molecular emission in nearby galaxies with suffi- 
cient resolution and sensitivity to distinguish individual 
giant molecular clouds (GMCs). The immediate goal 
of such studies is to determine whether (and how) the 
GMCs in other galaxies differ from those seen in the 
Solar neighborhood. The most common method used 
to address this question has been to use molecular-line 
tracers of H2, in particular 12 CO(l — ► 0), to compare 
the macroscopic properties (size, line width, and lumi- 
nosity) of GMCs in other galaxies to those of Milky Way 
GMCs. Unfortunately, a wide variety of methods have 
been used to reduce data from spectral line data cubes 
into macroscopic GMC properties. As a result, many 
of the differences between GMC populations found in 
the literature can be attributed, at least partially, to ob- 
servational artifacts or methodological differences. It is 
therefore difficult to assess what the real differences be- 
tween GMC populations are based on the reported data 
in the literature. 

For GMCs that are either marginally resolved or 
marginally detected, observational biases can be severe. 
Figure ^ shows the variation of the measured spatial size 
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and line width with the resolution for a model cloud. 
Typical Galactic GMCs have sizes of a few 10s of par- 
secs, comparable to the spatial resolution o f many data 
sets u s ed to study extragalactic G MCs (e.g. iVoeel et alJ 
Il987t IWilspn fc Scovillel 1199(1 iWilson Reidl I199H 
l lsrael et al.lll993HWilson fc Rudolph! 119931: iFukui et alJ 
119991 iSheth et alJl2000l iRosolowskv et al.ll2003|) . Fig- 
ure ^ shows that when the size of the beam is com- 
parable to the size of the object, the measured size is 
much higher than the true size of the object. Millimeter 
spectrometers and correlators often have excellent fre- 
quency resolution, so the spectral resolution bias is usu- 
ally less important for GMC studies, but it can become 
substantial when data are binned to increase signal-to- 
noise. Figure shows that the measured spatial size, 
line width, and flux of a re al GMC in M 33 (EPRB1 
from IRosolowskv et al.ll2b03j) are all strong functions of 
the sensitivity of the data. We discuss another major 
source of bias, the method by which emission is decom- 
posed into GMCs, in the Appendix. 

To place these biases in the the context of real molec- 
ular cloud studies, Figure shows the peak flux and 
angu lar size of typical GMCs (those of iSolomon et alJ 
Il987|) as a function of distance. The sensitivities and 
resolutions of a representative sample of molecular cloud 
studies have been indicated as horizontal lines in these 
plots. Distances to commonly observed objects have also 
been labeled. Figure |21 demonstrates that most studies 
of extragalactic GMCs are conducted where the clouds 
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Fig. 1. — The resolution bias for a model cloud. These two 
panels show the variations in the measured spatial size (left) and 
line width (right) as a function of the resolution of a data set. 
These plots show measurements of Gaussian clouds convolved with 
a Gaussian beam and integrated across square velocity channels. 
Marginally resolved data yields measurements affected by a signifi- 
cant resolution bias. The spatial size bias is encountered frequently 
in extragalactic GMC measurements. 



This section describes how to derive the spatial size, 
line width, and flux from a region of emission within a 
spectral line map (a "data cube") while accounting for 
the finite sensitivity and re solution of the data set. We 
use moment methods (e.g. iSolomon et al.lll987j) . which 
make full use of position and intensity information with- 
out assuming a functional form for the cloud. They are 
therefore robust to pathologies in the data within the 
cloud. 

Moments are, however, sensitive to the inclusion of 
false emission (noise) at the edge of a cloud. Including 
noise has the effect of artificially increasing the values of 
the moment. Therefore the methods outlined here should 
be employed in conjunction with careful signal identifi- 
cation so that the calculations include as little noise as 
possible. We assume throughout this section that the al- 
gorithm is being applied to a distribution of real emission 
that we label a "cloud" (we discuss signal identification 
and decomposition in the Appendix). 



of interest are only marginally resolved and are found at 
low sensitivity. Even future observations of GMCs in the 
Virgo cluster using the Atacama Large Millimeter Array 
(ALMA) will be affected by resolution and sensitivity 
biases. 

In this paper, we examine the effects of biases stem- 
ming from finite spatial resolution, spectral resolution, 
and sensitivity in molecular- line observations of GMCs. 
We recommend data analysis methods to produce a stan- 
dardized set of observed cloud properties that account for 
these biases. Most of the methods used in this paper have 
been adopted piecemeal and ad hoc in previous studies. 
Here, we endeavor to justify our choice of methodolo- 
gies and to synthesize various author's techniques for 
approaching the problems of molecular cloud data anal- 
ysis. In Section we describe a standardized method to 
measure three basic properties of an emission distribu- 
tion — size, line width, and flux — while accounting for 
the sensitivity and resolution of a data set. In Section|Sl 
we discuss how these measurements can be transformed 
into physical quantities — radius, line width, luminos- 
ity, and implied mass. Finally, we consider the effects of 
using interferometers to derive cloud properties in Sec- 
tion The results in these three sections are applicable 
to all observations of molecular clouds. In contrast, the 
methods used for decomposing emission into molecular 
clouds vary widely, and there is little basis for favoring 
one method over another in all cases. Hence, we defer a 
presentation of our decomposition algorithm to the Ap- 
pendix of the paper, leaving only a brief discussion of 
the decomposition problem in Section |SJ We conclude 
the paper by exhibiting several examples of the applica- 
tion of our standardized methods to previously observed 
data and making recommendations for future observa- 
tions (Section |BJ). 

The methods described in this paper require a com- 
puter program to apply. A documented software ver- 
sion of the decomposition and measurement algorithms 
is available from the authors as an IDL package. 

2. MEASURING MOLECULAR CLOUD PROPERTIES 



2.1. Moment Measurements of Size, Line Width, and 

Flux 

This subsection describes how to apply moment meth- 
ods to derive the size, line width, and flux from a distri- 
bution of emission (a "cloud" ) within a position-position- 
velocity data cube. The data cube consists of a number 
of pixels that have sizes of 6x, Sy, and 5v in the two spa- 
tial dimensions and the velocity dimension, respectively. 
The ith pixel in the data cube has positions Xi and iji, 
velocity Vi, and brightness temperature Ti. We assume 
that the cloud is contiguous and bordered by an isosur- 
face in brightness temperature of value T ec i ge , so that all 
of the pixels in the cloud have T > T e d ge and the pixels 
outside the cloud have T < T e d ge or are separated from 
the cloud by emission with T < T e d ge - 

We begin by rotating the spatial axes so that the x and 
y axes align with the major and minor axis of the cloud, 
respectively. We determine the orientation of the major 
axis using principal component analysis. We find the 
eigenvectors of the intensity-weighted covariance matrix 
for the cloud, 
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In the equations above the sum runs over all pixels 
within the cloud and x and y are the intensity weighted 
mean positions within the cloud (defined below). We 
define the new x axis to lie along the eigenvector with 
the largest eigenvalue — the major axis of the cloud. The 
y axis lies perpendicular to the x axis along the minor 
axis of the cloud. In the discussion below, x refers to 
position along the major axis and y refers to position 
along the minor axis. Rotating the axes in this manner 
yields information about the axis ratio of the cloud and 
allows a more careful deconvolution. This method for 
determining the posi tion angle o f mole cular clouds has 
also been adopted bv lKoda et al.l l)2005|) . 

To measure of the size of the cloud, we compute the 
geometric mean of the second spatial moments along the 
major and minor axis. This is a r , the root-mean-squared 
(RMS) spatial size: 
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Inverse Sensitivity [T^/T^J Inverse Sensitivity [T dIt /T„,] 

Fig. 2. — The sensitivity bias for a bright GMC in M 33 (EPRB1 from Rosolowskv ct al. 2003). These four panels show the variations 
in the measured spatial size, line width, and flux of the GMC for a range of sensitivities (signal-to-noise ratios). We simulate the effect of 
changing sensitivity by varying the boundary isosurface, T e ^ ge , and measuring properties only from data within this isosurface. All four 
properties are a strong function of T e< j ge , so the data display a substantial sensitivity bias. The gray, dashed lines show the extrapolation 
to the Kelvin isosurface using a weighted, least-squares fit to each property as a function of T et j ge . The extrapolated value is shown as 
an X. 
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where a ma] {T ed ge) and a mm (T edge ) are the RMS sizes 
(second moments) of the intensity distribution along the 
two spatial dimensions. We adopt this particular func- 
tional form since it has been used i n previous obser- 
vational studies d Solomon et alj| 1987l) and explored in 
depth by Bcrtoldi fc McKea l)1992fl with respect to in- 
clination, aspect ratio, and virialization. We calculate 

®maj(T e( Lge) and O 'min(T- ^edge) by. 
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, where (5) 
(6) 



In the equations above the sum °" runs over all pix- 
els within the cloud. We have written each of the mo- 
ments as a function of T edge because changing the isosur- 
face that defines the boundary of the cloud (T edge ) will 
change the set of pixels included in the sum and therefore 
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Fig. 3. — Peak flux density (left) and angular size (right) of molecular clouds as a function of distance. Diagonal lines indicate the 
peak fl ux density and size of clouds with 10 3 ,10 4 ,10 5 and 10 6 Mq (bottom to top) from the relationships observed in the inner Milky 
Way I Solomon ct al. 1987). For reference, the diameters of these clouds are roughly 3, 10, 30, and 100 pc respectively. Horizontal black 
lines indicate the typical 5<r sensitivity (left) and ef fective resolu t ion (r ight) of various telescopes used to observe th e cloud. Observationa l 
parameters from the following studies: CfA 1.2 m: Dame ct al. (2001), Massachusetts-Stony B rook (MSB) survey : Sanders ct al. (1986), 
FCRAO Outer Galaxy S urvey: IHever et all ft99Sh . NANTEN studies of the Magellanic Clouds: IFukui et ail 119991) . BIMA studies of M33: 
Rosolowskv ct al. (2003), ALMA: web-based sensitivity calculator. 



the values of the moments. Note that ay is not the RMS 
distance (d — ^/ x 1 + y 2 ) from the center of the cloud. 
Rather it is the analogous to the RMS size of the cloud 
along an arbitrarily chosen axis. Thus, ay = a x = o~ y for 
a perfectly round cloud, while the RMS distance from the 
center for such a distribution is larger, ad = V2a x > a x . 
Also note that a ma j / a m i n is the axis ratio of the cloud 
and will be ~ 1 for round clouds and ^ 1 for elongated 
or filamentary clouds. 

We calculate the velocity dispersion, a v (T e d ge ) in the 
same manner as the size: 
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The sums again run over all emission in the cloud. 
For a Gaussian line profile, such as that found for 
most clouds, the full-width half-maximum (FWHM) line 
width, AV(T e d ge ) will be related to a v (T e d ge ) by 



AV{T edg e) = V81n(2) a v (T edge ). 



(9) 



Finally, we calculate the flux of the cloud, Fco{T e d ge 
using the zeroth moment: 



Fco(T e dge) =z2 Ti ^ V Sx 5v ' 



(10) 



If Sx and 5y are in units of arcseconds, Sv in km s , and 
Ti in K, then the resulting flux will have units of K km 
s _1 arcsecond 2 . 



2.2. Correcting for the Sensitivity Bias 

The sensitivity of a dataset influences the cloud prop- 
erties derived from that data, a fact that we have em- 
phasized in the previous section by explicitly writing the 
moments as functions of T e d ge , the cloud boundary (usu- 
ally set by the signal-to-noise ratio of the data). Figure 
12 shows the variation of spatial size, line width, and flux 
as a function of sensitivity for a bright cloud in M33. 
The data for this cloud shows a substantial sensitivity 
bias; all of the derived properties are strong functions 
of the boundary isosurface (T e d ge )- In order to compare 
data sets with different sensitivities, one must account for 
this bias. In this section we describe a method to do this 
by extrapolating the measured properties of a cloud — 

0~maj \Fedge) : ® min^\T edge) i @ v^yT edge) ■> and FQQ(T e dge) 

to those we would expect to measure for a cloud within 
a boundary isosurface of T e d ge = Kelvin (i.e., perfect 
sensitivity). 

We estimate the values of the moments at T e d ge = K 
by extrapolating from higher values of T e d ge - This tech- 
nique was originally suggested f or inferring total cloud ar- 
eas bv lBlitz fc Thadd eus] <|19§ 0j) and appl ied to molecular 
cloud properties bv iScoville et alj (|Tp87) . We calculate 
each of the moments for a sample of boundary tempera- 
tures, T e d ge , ranging from near the peak temperature of 
the cloud to the lowest boundaries allowed by the data. 
Thus, we measure the variations of the moments as a 
function of the boundary temperature, T e d ge , within the 
cloud (this is how we constructed the plot shown in Fig- 
ure |2l . Below, we assume that we have measured each of 
the four moments for values of T e d ge ranging from T m i n 
(the minimum allowed by the data, that is the sensitivity 
limit) to T max (near the peak temperature of the cloud). 

We estimate the value of the moments at T e d ge = K 
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Sensitivity [T m „/T edge ] 

Fig. 4. — The recovered flux as a function of sensitivity, 
Tp e ak/T e dge- The three lines show the fraction of the total flux 
measured for a Gaussian cloud using no sensitivity correction 
(dashed line), a linear extrapolation to perfect sensitivity (gray 
line), and a quadratic extrapolation (black line). The quadratic 
extrapolation does an excellent job of recovering the correct value 
of the flux down to low sensitivities. The uncorrected sum and the 
flux corrected by a linear extrapolation both show a substantial 
sensitivity bias. 



by performing a weighted, linear least-squares fit to 
the measured moments. As an example, we consider 
&maj (T e dge ) ■ The data are modeled as 

(Tedge) = m T e dge + (0 K) (11) 

and the fit determines the extrapolated moment, 
fmaj(0 K). For the fit, each pair of data 
{T edge , (T-maj (Tedge)} is assigned a weight proportional 
to the number of data in the cloud with T > T e d g e, 
so that measurements of the moment using more data 
are weighted more heavily. Practically, this means that 
points to the left in Figure |21 have higher weights than 
those to the right. We use this linear extrapolation for 
Cmaj , v-min , and <j v , but we find that a quadratic extrap- 
olation (including a T^ d term) gives better results for 
the zeroth moment, Fco (though we revert to a linear ex- 
trapolation when the extrapolated flux is lower than the 
measured flux). We plot the flux of a Gaussian cloud as 
a function of T e d g e in Figure^ for an uncorrected zeroth 
moment and the linear and quadratic extrapolations to 
illustrate the appropriateness of the quadratic fit to the 
zeroth moment. At very low sensitivities (signal-to-noise 
ratios near unity), the quadratic extrapolation is very 
noisy, but for signal-to-noise ratios of 2 or better it does 
a dramatically better job of recovering the true flux of 
the cloud (F/Fq = 1) than either the linear extrapolation 
or no extrapolation. 

Figure also shows these extrapolations for a bright 
cloud in M 33. The result of this extrapolation is a set 
of four moments — a maj (0 K), <r mi „(0 K), a v (0 K), 
and ^00(0 K) — that correspond to those we would 
expect to measure given infinite sensitivity. The val- 
ues of these moments should be directly comparable 
even among datasets with different sensitivities (values 
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Note that diffuse emission surrounding a GMC may 
confuse this method. If one data set is measured with 
sensitivity sufficient to detect diffuse emission surround- 
ing a GMC, while another lacks the sensitivity to do 
so then this approach may not be sufficient to correct 
for the sensitivity bias. This problem may be partic- 
ularly acute when comparing Galactic GMCs observed 
with very good sensitivity to extragalactic clouds with 
worse signal-to-noise ratios. Interferometric data "re- 
solves out" emission significantly more extended than 
the synthesized beam (see , repres enting anot h er bia s 
again st dete cting diffuse emiss i on. [ Polk et alJ I l988). 
iBlitzl lll985|) . iRosolowskv et all l)2003|) . and lLerov et ail 
(2005) find evidence for diffuse emission surrounding 
GMCs in the Milky Way and the Local Group Galax- 
ies M 31, M 33, and IC 10, respectively. 

2.3. Correcting for the Resolution Bias 

Any astronomical data set represents the convolution 
of the intensity of the source with the profile of the instru- 
ment used to observe it. Care must therefore be taken 
in measuring sizes and line widths when the extent of 
the intensity distribution is comparable to the instru- 
mental profile. In a typical spectral line data cube two 
profiles are important: the spatial beam and the width 
of a velocity channel. In this section, we describe simple 
corrections to account for the effects of finite spatial and 
spectral resolution. 

We "deconvolve" the spatial beam from the measured 
cloud size by subtracting the RMS beam size, abeam, 
from the extrapolated spatial moments, o~ ma j(T e dge — 
K) and o- m in(T e dge = OK), in quadrature — an ap- 
proach that is exact for Gaussians. The deconvolved sec- 
ond moment is given by: 

*r,*c = yVL, (0 K) - < Qm ]i/2 tfT (0 K) - al a J^ 

(12) 

where a ma j (0 K) and o~ m in (0 K) are extrapolated to the 
Kelvin isosurface as described in H2. 21 This extrapola- 
tion is necessary to make this deconvolution valid: sub- 
tracting the full abeam from the spatial moment measured 
for only part of the cloud will lead to an overcorrection 
and thus to an underestimate of the cloud size. Mea- 
suring the spatial size along the minor axis is also nec- 
essary to ensure that the cloud is indeed resolved in all 
dimensions. This is an advantage of the choice of axes 
(major/minor) described above. With sufficient signal- 
to-noise, this method of deconvolution provides a robust 
measurement of cloud size even for marginally resolved 
clouds. 

Instrumental resolution also affects the measured line 
width. Spectrometers measure the average intensity 
across a channel, rather than sampling the intensity at 
the center (nominal frequency) of that channel. When 
the width of the spectral line under consideration is com- 
parable to the bandwidth of a single channel, the line 
strength varies significantly across an individual chan- 
nel. In this case, the average value may differ substan- 
tially from the value at line center. The output of the 
spectrometer is thus a convolution of the true spectral 
profile with the profile of an individual channel. We ac- 
count for this potential bias towards higher line widths 
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by a simple deconvolution of the channel width from the 
extrapolated second moment: 



a v , dc = ^(OK)--^* (13) 

where o~ v (0 K) is the second moment of the cloud in the 
v dimension extrapolated to Kelvin as described in ^2.21 
and AV c han is the width of a velocity resolution element. 
Although the channel profiles are usually square in shape 
and not Gaussian, we simplify the deconvolution by ap- 
proximating the channel shape as a Gaussian with inte- 
grated area equal to that of a square channel with width 
&V chan . For such a Gaussian, a chan = AV chan /V^- 

2.4. Comparison with Other Methods 

We use extrapolated moments to measure GMC prop- 
erties rather than employing an established method from 
the literature. In this section, we justify our choice by 
comparing several methods of measuring GMC proper- 
ties. We focus on the performance of these methods at 
marginal resolution and low signal-to-noise, conditions 
typical of extragalactic GMC observations. 

Determining the radius of a cloud is particularly diffi- 
cult because GMCs are often asymmetrical with poorly 
defined boundaries. Several authors have devised meth- 
ods to return a single characteristic size for complicated 
emission distributions. The intensity-weighted second 
moments in t he spatial directions h ave been used in many 
studies (e.g. iSolomon et al.lll98^) . but are sensitive to 
both noise and convo lution effects. The other commonly 
used method (e.g. I Williams et al.1 119941: IHever et al.1 
|2001[) is to infer the radius based on the area of the cloud: 

Here A pt is the area of a point source that has 
been convolved with a beam and measured with the 
same signal-to-noise as the em issio n in the map. Fi- 
nally, iWilson fc Scovillei l|1990ft and iTavlor et all l|1999ft 
adopted the size of the cloud as the mean of the decon- 
volved FWHMs of the emission distribution along two 
perpendicular directions. 

We compare these three methods to the extrapolated 
moment method presented above across a range of reso- 
lutions and sensitivities. We measure the size of a Galac- 
tic GMC using each method after convolving it to a de- 
sired resolution and adding noise to produce a particular 
signal-to-noise ratio. For the data, we use the 12 CO dat a 
from the Rosette molecular cloud l)Blitz fc Starld ll986'l. 
which we clip at 2ctrms (the RMS noise in the origi- 
nal data set) and integrate in the velocity dimension to 
produce a map of integrated intensity. For a range of 
sensitivities and resolutions, we convolve this map with 
a Gaussian beam and add noise. We measure the size of 
the cloud in 100 realizations of the noise for each such 
{resolution, sensitivity} pair using (a) the extrapolated 
moment method, (b) the moment of the data without 
extrapolation, (c) the area method and (d) the FWHM 
method. Wherever possible we corrected for the effects 
of beam convolution and signal-to-noise for each of the 
methods. The results of the analysis are shown in Figure 

m 



Figure [5] shows the recovered radius as a function of 
resolution and sensitivity, with the "true" radius defined 
as that measured at very high sensitivity and very good 
resolution (i.e. in the original data, the top right cor- 
ner of each plot). The hashed region of parameter space 
shows the range of parameters over which each algorithm 
recovers a radius within 10% of the true value. The ex- 
trapolated moment method has the largest hashed re- 
gion and so is remarkably robust, recovering the true 
radius over a large range of sensitivities and resolutions. 
Only at low sensitivity (T max /<jRMS < 5) and marginal 
resolution (abeam ^ &o), does the derived radii depart 
systematically from the true radius. Notably, the ex- 
trapolated second moment performs quite well at signal- 
to-noise ratios from 5 to 10 (in the integrated intensity 
map), values typical of extragalactic CO data sets. By 
contrast, the uncorrected moment method (panel b in 
Figure [SJ) underestimates the size at low signal-to- noise 
since (by construction) the uncorrected moment does not 
account for emission below the noise level. Similarly, 
the area method (panel c) shows systematic variation at 
both low signal-to-noise (where emission drops below the 
noise level) and low resolution (where the convolved area 
of the emission distribution grows disproportionately be- 
cause of the filamentary nature of the cloud) . Finally, the 
FWHM method (panel d) shows large systematic varia- 
tions since it depends only on the location of the FWHM 
contour and not on the remainder of the emission distri- 
bution. 

The region of systematic underestimation at low signal- 
to-noise but reasonable resolution shows the effects of 
missing the diffuse emission mentioned in H2.2I The 
Rosette includes more CO emission at low intensities 
than the extrapolated moment predicts from the high 
intensity data. As a result, when that diffuse emission 
is not included in the measurement, the algorithm un- 
derestimates the true radius of the GMC. This effect is 
seen panels (a) and (b) — the extrapolated and uncor- 
rected second moments — and is more pronounced in the 
uncorrected second moment, panel (b). 

We perform a similar experiment on recovering the line 
width of an emission line in noisy data. We measure 
the recovered line width of a Gaussian line of known 
width using three methods (a) the extrapolated moment 
method (b) an uncorrected second moment and (c) a 
Gaussian fit to the line. For a range of signal-to-noise 
levels (T max / 1 cjrms) and channel widths AV c han we mea- 
sure the recovered line width relative to the known line 
width. The mean values of the recovered line for 1000 
realizations of the noise are plotted in Figure H3 The 
extrapolated moment does not show the systematic vari- 
ation with signal-to-noise seen in the uncorrected mo- 
ment. The extrapolated moment is nearly as robust a 
measure as the Gaussian fit for a perfectly Gaussian line 
and will prove superior if the line is not Gaussian. Ro- 
bust recovery of the line width using any method requires 

AV Une / AV chan > 2. 

2.5. Assessing Errors in GMC Properties 

The formal uncertainty associated with each moment 
measurement is quite small. Cloud identification and ex- 
trapolation represent larger sources of uncertainty, but 
their effects arc difficult to assess formally. We use 
bootstrapping methods to estimate the uncertainties in 
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Fig. 5. — The recovered radius as a function of sensitivity (Tmax / cr rm s) an d resolution (o"o/o"(, eam ). The four panels show the radius 
recovered by (a) the extrapolated moment method (this paper), (b) the uncorrected moment method, (c) the area method, and (d) the 
FWHM method. The hashed region indicates the where each algorithm obtains a result within 10% of the original value. The extrapolated 
moment accurately recovers the radius over a large zone of parameter space. Of particular importance for extragalactic measurements, it 
performs better at low signal-to-noise than the other three methods. 



our measurements of cloud properties. The bootstrap- 
ping method determines errors by generating several trial 
clouds from the original cloud data. A trial cloud is gen- 
erated by considering the cloud to be a collection of data 
{xi, yi, Vi, Ti} for i — 1...N, the number of points in 
the cloud. The data are sampled for N random values 
of i, allowing for i to be repeated. The properties of the 
cloud are measured for each trial cloud. We estimate 
the uncertainty from the variance of the cloud properties 
derived from these resampled and remeasured data sets. 
The final uncertainty in each property is the standard 
deviation of the bootstrapped values scaled up by the 
square root of the oversampling rate. The oversampling 
rate, which is usually equal to the number of pixels per 
beam, accounts for the fact that not all of the data in 
each cloud are independent. For many interferometric 
data sets this is an important effect, since these data can 
have 10 or more pixels per beam. 
We compare the uncertainties produced by the boot- 



strapping to those derived from repeatedly adding noise 
to and then reanalyzing a data set. We use the bright 
cloud in M 33 shown in Figure [21 We conduct 100 re- 
alizations of the data plus random noise. The resulting 
uncertainties in a ma j, <J m in, &v> and the flux are 3%, 2%, 
3%, and 3%. Repeatedly bootstrapping the same data 
set (adjusted to have the same final noise level) yields 
average uncertainties of 9%, 9%, 11%, and 5%. The 
bootstrapping estimates are higher for this bright cloud 
because they reflect both the formal uncertainty and the 
robustness of the result to the removal of a given piece of 
data. In the low signal-to-noise regime, the values for the 
two methods converge as noise dominates the uncertainty 
derived from bootstrapping — for example, performing 
the same experiment in a dimmer M 33 cloud with 1/4 
the luminosity of the bright cloud and comparable noise, 
bootstrapping yields errors of 31%, 33%, 32%, and 35% 
in the four moments while repeated realizations produces 
scatters of 15%, 27%, 30%, and 40%. 
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Fig. 6. — The measured line width as a function of sensitivity {T ma x/vRMs) an d spectral resolution (AVpw H M / A-V c h an ) . The three 
panels show the line width of a Gaussian with known properties measured using (a) the extrapolated moment method (this paper), (b) 
an uncorrected second moment and (c) a Gaussian fit to the resulting spectrum. The hashed region indicates the zone of parameter space 
where each algorithm measures a line width within 10% of the actual value. The extrapolated moment provides a good measurement of 
the line width, remaining comparable to a direct fit even in the limit of low signal-to-noise and spectral resolution. Because the Gaussian 
fit is provides poor results for non-Gaussian line profiles, we prefer the extrapolated moment method. 



The bootstrapping method produces a robust, believ- 
able estimate of the uncertainty in the measurement of 
the properties of a particular, defined cloud. It does not 
account for uncertainties in the assignment of emission to 
a cloud either as a result of noise or choice of algorithm. 
These uncertainties are more systematic than random in 
nature and may be best assessed by analyzing the emis- 
sion distribution using several methods. The bootstrap- 
ping estimate may be treated as an accurate estimate of 
the uncertainties in the results given that one adopts the 
methods presented in this paper. 

3. DERIVING PHYSICAL QUANTITIES FROM MOMENT 
MEASUREMENTS 

In this section, we outline how to use the measured 
size, line width, and flux to calculate several physical 
quantities of interest: the effective spherical radius, the 
virial mass, and the luminous mass. Throughout this 
section we assume that clouds can be described as self- 
gravitating spheres with density profiles /> a r 1 and 
negligible support from magnetic fields or confinement 
by external pressure. 

We assume below that the data consists of observa- 
tions of the 12 CO (1 — ► 0) transition, in units of bright- 
ness temperature, but the method is readily adaptable 
to analogous data sets. 

3.1. The Spherical Radius 

We define a factor r\ that relates the one-dimensional 
RMS size, ay, to the radius of a spherical cloud R: R — 
rja r . It is possible to derive an estimate for rj based on 
spherical cloud of radius R with a density profile par - ' 3 . 



In this model, 



Jo J ^ d6 Jq d(p x 2 por ^r 2 sin <j) 



(14) 



Jo d r Jo ^ dd Jq d(j) por~Pr 2 sin < 

R 2 (15) 



3 5-/3 



so that r) ■ 



5-/3 
3^ 



(16) 



For a cloud with /3 = 1, r\ = y/6, somewhat higher than 
the empirical correction of 3.4/ y^r derived by Solomon 
et al. (1987). The difference arises, in part, from using 
12 CO as a density tracer. Since 12 CO emission saturates 
in dense regions and vanishes from low density regions, 
the apparent density profile in 12 CO is shallower than 
the true density profile. Hence, an appropriate value of 
rj likely falls in the range between 3.4/ v / 7r and y/6. For 
tracers like 13 CO with higher critical densities, a differ- 
ent value of T) may be appropriate. Since comparison to 
this "anchoring" data set may be more important than 
adopting a self-consistent — but grossly oversimplified 
— model for a cloud, we recommend the Solomon et al. 
(1987) definition of the cloud radius, R w 1.91oy. Note, 
that adjusting the definition of the radius renders the 
virial mass formula we present below inexact. 

3.2. The Spatial Size, Line Width, Luminosity, and 

Mass 

We convert the cloud properties oy(0 K), a v (0 K), and 
Fco(0 K) to the physical quantities R, AV, and £co- 
For a cloud at a distance of d (in parsecs), the physical 
radius will be 
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R(0 K) arcsec tt 

-R pc = — - x x d pc 

LF J 3600 180 1 J 

the FWHM line width will be 
AV 



V8U2)a v (0 K) , 
and the luminosity of the cloud, Lqo j will be 



(17) 



(18) 



i C o[K km s^ 1 pc 2 ] =F CO (0 K)[K km s^ 1 arcsec 2 ] 
x (d[pc]) 2 



180 • 3600 



(19) 



A particular CO luminosity, L^o, implies a mass of 
molecular gas, AfL um , of 



Mi 



Lum 



2 x 10 20 [cm- 2 /(K km s" 1 )] 
= 4.4 X 2 Leo 



x 4.4 Leo 
(20) 



where Xco is the assumed CO-to-H2 conversion factor. 
This calculation includes a factor of 1.36 (by mass) to 
account for the presence of helium. Including helium is 
important to facilitate comparison with the virial mass, 
which should reflect all of the gravitating mass in the 
cloud. We have adopted a fiducial value of the CO-to-H2 
conversion factor of Xco = 2 x 10 20 cm~ 2 (K km s -1 ) -1 
and express changes relative to this value in terms of the 
parameter X 2 . 

3.3. The Virial Mass 

We compute the virial masses under the assumption 
that each cloud is spherical and virialized with a density 
profile described by a truncated power law of the form 
p oc r~P with no magnetic support or pressure confine- 
ment. As with the spherical radius correction, the exact 
density profile of the cloud will affect the correct form of 
the virial theorem mass. For (3 = 1, the virial mass is 
given by the formula (Solomon et al. 1987): 



M VT = 189 Mq AV 2 R 
and more generally by 



M VT = 125 Mq 



2/3 



3-/3 



AV 2 R 



(21) 



(22) 



where AV is the FWHM velocity line width in km s _1 , 
R is the radius in pc, and the cloud has a density profile 
of p oc r~P . 

Clouds exhibit a range of non-spherical geometries and 
may be supported by magnetic fields or confined by pres- 
sure. Therefore, the studying the virial parameter may 
be more useful than the virial mass itself. The virial 
parameter is a constant of order unity that character- 
izes deviations from the virial theorem applied to a non- 
magnetic cloud wit h no external pres s ure an d constant 
density. Following iMcKee fc Zweibell lfl9921. we define 
the virial parameter as 



5<r 2 R 



{AAX 2 L co )G 



(23) 



Larger-than-unity virial parameters can result from pres- 
sure confinement, while a < 1 may result from significant 
magnetic support. Incorrect values of the CO-to-H 2 con- 
version factor may skew the result in either direction. 
Finding a < 2 means that the clouds are gravitationally 
bound in the absence of significant magnetic support. 

4. MEASURING CLOUD PROPERTIES FROM 
INTERFEROMETER OBSERVATIONS 

Millimeter-wave interferometers arc required to resolve 
even the most massive molecular clouds in galaxies be- 
yond the Magellanic clouds (see Figure |3J|. Unfortu- 
nately, interferometers are not sensitive to spatial fre- 
quencies outside the limited region of the (u, v) plane 
that they sample. Practically, this means interferometers 
do not measure the total flux from the emission distri- 
bution; and structures are resolved out, usually on large 
angular scales that correspond to small separations in 
the (u, v) plane. Ideally, interferometer observations are 
combined with single-dish observations that supply the 
missing information. In practice such observations are 
conducted infrequently and the unknown total flux and 
short-spacing information is estimated using deconvolu- 
tion algorithms such as C LEAN or maximum entropy 
f seelTho mpson e t alJ l2001, and references therein). 

ISheth et al.H2000D simulate the results of using only an 
interferometer to observe Galactic GMCs as if these well- 
studied clouds were located in M31. They find that inter- 
ferometers experience significant (50%) flux loss for their 
simulated observation, primarily from extended emission 
around clouds. However, they find that the flux loss 
does not ch a nge t he size and line width of the cloud. 
IHelfer et alJ |2002) examine the recovery of large-scale 
flux distributions from interferometer measurements in 
more depth and explore the effectiveness of deconvolu- 
tion algorithms at low signal-to-noise. They find that 
deconvolution algorithms recover flux nonlinearly at low 
sensitivities, finding much less flux at low sensitivities 
than one would expect. Since much of the data on extra- 
galactic GMCs have low signal-to-noise, this may repre- 
sent an important bias. 

We assess the effects of interferometric biases on 
th e methods p re sented here by extending the method 
of ISheth et all 1)2000(1 . We use 12 CO observations 
of th ree galactic GMC s: the Orion molecular com- 
plex llWilson et al.ll2005|) . the Rosette Molecular Cloud 
l)Blitz fc1starkTIT9 8|] [). and an excer pt from the Outer 
Galaxy Survey of iHever et alJ l)2001l) which contains the 
molecular clouds associated with the W3/W4/W5 H II 
regions 2 . We simulate observing these three molecular 
complexes in M31 (D = 770 kpc) with the BIMA inter- 
ferometer. 

We Fourier transform each plane of each data set into 
the (w, v) domain and resample the data along the (u, v) 
tracks that would be sampled by BIMA observations of 
the data provided the GMCs were in M31. The (u,v) 
coverage reflects typical observing strategies for extra- 
galactic clouds, such as interleaving observations of the 
source with calibrators and other sources. We add ther- 
mal noise and phase noise to the (u, v) data, including a 
phase noise component with a magnitude that depends 

2 A map of the original Orion data and typical maps at low 
resolution and sensitivity appear in the Appendix. 
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on the length of the baseline. We adjust the scale of 
the thermal and phase noise to produce the desired peak 
signal-to-noise (we find our results depend only weakly 
on whether the noise is thermal or phase) . We invert the 
resul ting (u,v) data u sing the MIRIAD software pack- 
age l)Sault et al.1 119951) producing maps separated by 2 
km s , and then we deconvolve the dirty maps using a 
CLEAN algorithm that terminates at the 2o-rms level. 
For each trial cloud, we then calculate the cloud proper- 
ties using the methods of §321 and [31 

For comparison, we compute cloud properties using 
the same procedure to simulate single-dish observations 
with signal-to-noise and effective resolution identical to 
the mock interferometer data. We generate the mock 
single-dish observations by sampling the transformed im- 
age data for an equal number of (u, v) points as the inter- 
ferometer data, but the points are normally distributed 
in the (u, v) plane and one point is forced to lie at (0, 0) 
thus sampling the total power. The width of the (u, v) 
point distribution is chosen to give a beam size similar 
to that of the mock interferometer data. Random ther- 
mal and phase noise is added to these data in the same 
fashion as for the interferometer data. Then the data are 
inverted using natural weighting and deconvolved in the 
exact same fashion as the interferometer data (though 
the deconvolution step has little effect). Again, we ex- 
tract cloud properties using the methods of §S0and|j|l 

With these simulations, we compare the cloud proper- 
ties derived from interferometer data to single-dish data 
that are equivalent in every other fashion, thereby isolat- 
ing the effects of interferometers on the derived proper- 
ties. The additional biases imposed by limited resolution 
and sensitivity are discussed separately in H2.4I Here we 
focus on mock BIMA observations of the three molecu- 
lar complexes using three antenna configurations: the C 
array (extended), the D (co mpact) array, and a combi- 
nation of C and D array ( see iWrightl l 1996. for details on 
the configurations). The synthesized beam sizes for these 
configurations are 14.4", 6.1" and 8.8" for the D, C and 
C+D hybrid array observations respectively, correspond- 
ing in turn to 54, 22 and 33 parsecs at 770 kpc. We con- 
duct 10 sample observations for each cloud in each array 
at a range of sensitivities ranging from T pea k / <jrms = 3 
to > 100. 

Figure [7| shows the properties recovered by mock ob- 
servations of the Orion molecular complex for each array 
and a range of sensitivities. The values of each properties 
are normalized by the value recovered by mock single- 
dish observations at the same sensitivity and resolution. 
Thus, the only difference between the four sets of prop- 
erties (C, D, C+D, and single dish) is the (it, v) coverage 
of the simulated observations. We find that the derived 
properties from mock single-dish observations follow the 
same behavior as the simulations in H2.4I Thus, it is 
possible to decouple the two sets of biases - those arising 
from marginal resolution and sensitivity and those aris- 
ing from using interferometers - and examine only the 
latter. We plot the results for Orion because these ob- 
servations show the most dramatic variation of the three 
complexes studied, but the results are qualitatively the 
same for all three data sets. Orion is the most sensi- 
tive of the three to spatial filtering because it consists 
of three GMCs and therefore shows more structure than 
the other two targets. 



Based on the results of the mock observations, we make 
the following comments regarding the use of interferom- 
eter data alone in measuring cloud properties. Most of 
these points can be seen visually in Figure 

1. Cloud properties measured from interferometric 
data are biased. The degree of bias is affected by 
the sensitivity of the array as well as the (u, v) cov- 
erage of the observations. 

2. A minimum signal-to-noise of 10 is required for sta- 
ble recovery of cloud properties. Below this level, 
errors in cloud properties, can approach 100% for 
interferometer data. 

3. Even for intermediate signal-to-noise values 
(Tp ea k/ ctrms = 10) there are significant system- 
atic effects on the cloud properties. The most 
extreme effects are on the luminosity measure- 
ment, which can be 40% lower than a single-dish 
observation. This effect is much less pronounced 
for measurements of the line width and the radius, 
which show < 10% variations. This result, that 
the radius and line width are relatively robust to 
the spatial filtering of the int erferometer, c onfirm s 
the qualitative results of iSheth et alJ ((2000) . 
The values of derived properties are always 
underestimated relative to single-dish observations. 

4. Even at high sensitivities, the spatial filtering of 
interferometers affects property recovery at the 
10% level. For example, C-array observations of 
Orion systematically underestimate the radius of 
the cloud by 10% even for very high signal to noise 
ratios and D-array observations underestimate the 
flux of the Orion by 5% even at high sensitivity. 

5. For interferometer observations, the dynamical 
mass measurements of GMCs are more robust than 
the luminosity measurements. This behavior will 
bias interpretations of the virial parameter in ex- 
tragalactic observations. Estimates of the CO-to- 
H2 conversion factor based on the assumption that 
GMCs are bound or virialized are likely to overes- 
timate Xqo- 

6. For a given signal-to-noise value, observations with 
the widest range of (u, v) coverage provide the most 
robust measurement of cloud properties. Thus, in 
achieving a given sensitivity, observers should favor 
arrays with more antennae or observations made in 
multiple configurations. 

5. A NOTE ON DECOMPOSITION 

The choice of how to decompose an emission distribu- 
tion into individual clouds may be the most important 
source of bias in GMC property measurements. Many 
different methods have been applied to identify GMCs 
in blended emission, the most prevalent being decompo- 
sition by eye and the app lication of the GAUSSCLUMPS 
( Stut zki fc Gus^ll990l) or the CLUMPFIND algorithm 
( Willi ams et aUll994j) . When comparing GMCs between 
two data sets, care must be taken to decompose emission 
in a consistent way across both data sets, preferably us- 
ing the same algorithm on both data sets. Furthermore, 
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Fig. 7. — The effects of interferometer observations on the measurement of cloud properties. Each panel shows the variation of cloud 
properties as a function of signal-to-noise for interferometer observations. The value of each property (luminosity, line width, radius) is 
normalized by the value that is measured for single dish observations at the same resolution and sensitivity as that of the interferometer. 
The Figure shows the behavior of cloud properties for simulated BIMA observations of the Orion molecular complex as if the clouds were 
located in M31. No D-array curve is plotted for radius measurements since Orion would be unresolved in M31. The recovery of cloud 
properties is a function of both sensitivity and the (u, v) coverage of the interferometer. 



the physical values of any tuning parameters in the al- 
gorithms should be matched where possible so that both 
algorithms search for peaks in the emission over the same 
spatial scale (rather than angular or resolution-units) or 
intensity range. This will avoid, for example, compar- 
ing "clumps" in a Galactic molecular cloud to GMCs 
in another galaxy. A more extreme method to ensure 
accurate comparison is to convolve the higher (spatial) 
resolution data set to the spa tial resolution of the other 
data (e.g. iSheth et al J 12000^1 . However, this approach 
clearly sacrifices accuracy of the derived parameters to 
allow a more careful comparison between two data sets. 

In the Appendix to this paper we present a robust, con- 
servative, new decomposition algorithm. This algorithm 
is designed to avoid creating spurious clouds from noise 
and to remain sensitive to non-Gaussian structures in the 
data. Additionally, the parameters of the algorithm are 
fixed to physical values rather than being determined by 
the data. This algorithm is designed explicitly with the 
goal of decomposing emission into GMCs (rather than 
clumps or other structures) with extragalactic data in 
mind. The methods for measurement of cloud properties 
described above — including the sensitivity and resolu- 
tion corrections — are independent of the decomposition 
algorithm and are important no matter what decompo- 
sition algorithm is chosen. In order to avoid confusion 
between these two separate problems, we choose to de- 
scribe the decomposition algorithm in the Appendix. 

6. DISCUSSION AND CONCLUSIONS 

We conclude the paper by applying these methods to 
molecular line data sets that have been previously pub- 
lished. In future studies, the algorithm will be used to 
evaluate the differences between GMC populations be- 
tween galaxies. Here, we simply present an analysis de- 
signed to demonstrate the method's utility. We present 
the median corrections found for a large set of extragalac- 
tic (Local Group) observations and a test application of 



our methods to Galactic data. We use all the methods 
discussed in the previous sections and the decomposition 
algorithm discussed in the Appendix. 

6.1. The Effects of Extrapolation and Deconvolution 

We apply the methods outlined here to an array of data 
from across the Local Group and measure the properties 
of 110 spatially resolved clouds to estimate the typical 
magnitude of the sensitivity and resolution corrections 
for extraga lactic data. We use BIMA data on M 3 3 
and M 31 l|Rosolowskv et all 120031: lRosolowskvl | 2005[) : 
NANTEN observations of th e LMC jFukui et al.N l999): 
OVRO observations of IC 1 llWalter 120051): and a SEST 
map of N83 in the SMC l|Bolatto et alJ l2003j) Table 
^ shows the number of clouds measured in each galaxy 
along with the median sensitivity and resolution correc- 
tions applied to the radius, line width, and flux. For 
comparison, we also measure the properties of a num- 
ber of clouds in the outer Gala xy (Quadrant 2, see be- 
low) from the lDame et"all pOOl CO survey of the Milky 
Way. Table n includes all spatially resolved clouds with 
masses (derived from the CO luminosity) of 5 x 10 4 M Q 
or more (92 of the 110 extragalactic clouds are above this 
mass). 

The numbers quoted in Table ^ are "correction fac- 
tors," 



R, 



corrected 



®v, corrected 



^uncorrected &v .uncorrected 



and 



-^CO , corrected 
^CO , uncorrected 



(24) 

Table^shows that throughout the Local Group data the 
corrections suggested in this paper have magnitudes of a 
few tens of percent. We draw several conclusions based 
on these data: 

1. Resolution effects on the size of clouds tend to be 
significant — we would overestimate cloud sizes by 
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~ 40% if we did not apply a deconvolution. In 
the Milky Way data, this effect is much less se- 
vere. The sizes of Milky Way clouds are measured 
to within 5% before the resolution correction. Un- 
resolved clouds do not contribute to Tabled so if 
the effects of the resolution bias were completely 
neglected this would be much larger (a naive ap- 
proach would measure these clouds to have the size 
of the spatial beam). 

2. Resolution effects on the line width are negligible 
throughout the Local Group data. 

3. Sensitivity effects are also significant. Without a 
correction for the sensitivity bias, the size, line 
width, and luminosity of clouds would all be sig- 
nificantly underestimated. This sensitivity bias is 
least severe — only about 20 - 30% — for the line 
width, and most significant (and variable) for the 
luminosity. Sensitivity corrections to the luminos- 
ity vary from 20% to more than 100%. 

4. The resolution and sensitivity biases for the size 
measurement tend to cancel out, so that the com- 
pletely uncorrected radius measurement is often 
within 10 - 20% of the corrected value. This is 
a happy coincidence of resolution and sensitivity 
within the Local Group, not evidence that sensi- 
tivity and resolution corrections are unimportant. 

5. The magnitude of corrections across the Local 
Group data are fairly uniform. This is because 
GMCs near the resolution limit tend to outnumber 
higher mass GMCs. Unresolved GMCs are not in- 
cluded in the analysis, so the median cloud through 
all the data sets appears marginally resolved. 

6. In order to compare extragalactic data to Galactic 
data (with very good sensitivity and resolution and 
therefore small corrections) it is crucial to correct 
for the sensitivity and resolution biases. 



6.2. Analysis of Second Quadrant CO Data 

The method described in this paper has been designed 
with extragalactic data in mind. However, a crucial step 
in interpreting extragalactic measurements is to make 
a fair comparison with Galactic data. In this section 
we report some results of applying our decomposition 
and measurement alg orithms to th e surve y of the second 
Galactic quadrant bv iDame et alJ (J2001) . W e compare 
the re sults of this analysis to the results bv lHever et alJ 
(2001) and show that our analysis recovers results that 
are consistent with theirs. 

We decompose and analyze 12 CO(l — > 0) from the sec - 
ond quadrant (Survey 17 in Table 1 of lDame et all 20011. 
The data set covers the Galactic plane from t = 70° 
to £ = 210° with a noise level of 0.3 K. We measure 
the distance to the molecular emission using the kine- 
matic distances by adopting a flat rotation curve with 
©lsr = 220 km s _1 and Rq — 8.5 kpc. We omit lo- 
cal emission by discarding all elements of the data cube 
with a kinematic distance less than 2 kpc as well as all 
elements in the data cube that are connected by signifi- 
cant emission in position or velocity space to such pixels. 



We apply the decomposition algorithm described in the 
Appendix and measured sizes, line widths, and luminosi- 
ties of GMCs using the methods of [££1 and 3231 The 
analysis recovers 431 clouds with resolved angular sizes 
and line widths located within 10 kpc of the Sun. We 
include the median sensitivity and resolution corrections 
for massive (> 5 x 10 4 M ) clouds in Table □ above. 

Do the results from our algorithm agree with previ- 
ous studies of Ga lactic GMCs ? The data set covers the 
region studied bv lHever etafl l)2001() using the 45" res- 
olution of the FCR AO 14m. That dat a set has a lower 
sensitivity than the IDame et alJ lj2001|) data, so we ap- 
ply a rudimentary sensitivity correction (assuming that 
the GMCs are Gaussian and using their peak tempera- 
ture and boundary isosurface) to their results and scale 
to Xco = 2 x 10 20 cm~ 2 (K km s" 1 ) -1 before comparing 
GMC properties. We focus on a comparison of the virial 
parameter between the two studies — a full treatment 
of the Galactic "Larson's Laws" is beyond the scope of 
this paper. We draw several conclusions from the com- 
parison: 

1. Figure [5] shows that the virial parameters derived 
in our analysis lar gely agree with those found by 
IHever et all p00l|) . Below masses of w 3 x 10 4 
Mq, both surveys find the same virial parameter 
in a given mass bin. 

2. Above ~ 3 x 1 4 Mq, our analysis of the 
IDame et all (|2001|) data set may yield a slightly 
higher virial parameter, on average. This may 
be evidence for the diffuse em ission mentioned in 
%T2l — the higher sensitivity IDame et all {2001) 
data may include diffuse emissi o n surr ounding the 
GMCs while the IHever et all lj2001|) data may 
miss this effect. It may also reflect inadequacies 
in t he simple sensitivi ty correction we apply to 
the IHever et al.l l|200H) data. Resolution effects 
may also play a role — the IHever et alJ ((2001) 
data set has ~ 5 times better resolution than the 
IDame et alJ 1)200 1|) data and so the lower resolution 
data may tend to lump unbound clouds together. 
The number of clouds in the high mass bins is rel- 
atively low, so the discrepancy may not be partic- 
ularly significant. 

3. We apply our algorithm to a small portion of the 
OGS data and find our corrections for resolution 
and sensitivity bias increase the mean virial pa- 
rameter to be cons istent with the results from the 
IDame et all (1200 1|) data. This suggests that the 
differences in the virial parameters for the high 
mass clouds in Figure |S] may simply be method- 
ological. 

4. Both catalogs of outer Galaxy clouds find an in- 
verse relationship between luminous mass and the 
virial parameter, approximately a oc M T ~,° m 2 — a re- 
lation ship that was also observed bv lSolomon et alJ 
dlQSl for inner Galaxy molecular clouds. 

Thus we find agreement with the results of previous 
studies of Galactic molecular clouds. Our methods ap- 
plied to Galactic data find the same behavior observed 
in earlier work and we find agreement among the method 
applied to several data sets. 
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TABLE 1 

Typical Corrections for Local Group Data 
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Fig. 8. — Virial Parameter as a Function of Luminous Mass for 
Clouds in the Perseus Arm. Gray dots indicate the 431 clouds for 
which the virial and luminous mass can both be determined. The 
solid black line is the average of the data in bins of 0.25 dex in 
mass. For comparison, the average virial parameter is plotted as a 
function of mass for the clouds in the OGS catalog (dashed line). 
The two data sets show similar virial parameters at all masses, 
recovering nearly the same virial parameter for a given mass. 



6.3. Conclusions 

We have presented a method for measuring macro- 
scopic GMC properties — spatial size, line width, and 
luminosity — from a region of emission in a spectral line 
data cube. This method corrects for biases from lim- 
ited sensitivity and resolution and produces reliable re- 
sults that are directly comparable among a wide variety 
of data sets. We correct for limited sensitivity via an 
extrapolation to a theoretical Kelvin isosurface. We 
apply a simple quadratic deconvolution to the extrap- 
olated values to account for resolution biases. We find 
that bootstrapping methods yield believable estimates of 
the uncertainties in the derived parameters. We present 
a set of suggestions for transforming the derived proper- 
ties into physical quantities of interest. In the Appendix 
to this paper we present a new method for decomposing 
emission into individual GMCs. This method is con- 
servative and robust, designed to produce robust results 
from low resolution/sensitivity data. In this section we 
have applied all of these methods to an array of extra- 
galactic (Local Group) and Galactic data. We find that 
the algorithm reproduces established results for Galactic 



GMCs and that the sensitivity and resolution biases are 
potentially significant — often ~ 40% — for even the 
most recent Local Group GMC measurements. 

Based on this investigation of the observational biases 
in measuring molecular cloud properties, we note sev- 
eral important points that should be considered in plan- 
ning observations of GMCs and interpreting the results. 
First, resolution and sensitivity biases can be corrected to 
< 10% error provided the cloud has a modest peak signal- 
to-noise (T pea k / o~ rm s ^ 10) and is marginally resolved 
Rcid > 0.8#fwhm, where 6>fwhm is the full width at half 
maximum extent of the beam. Given current and future 
telescope capabilities, even the properties of extragalac- 
tic GMCs can be accurately measured. From Figure |3 
we can see that single dish surveys can accurately study 
clouds more massive than 10 4 M in the Magellanic 
clouds and careful interferometer observations can re- 
cover cloud properties for clouds with mass M > 10 5 M Q 
in M31 and M33. However, interferometer observations 
systematically underestimate molecular cloud properties. 
All else being equal, interferometers can underestimate 
fluxes by 40% and cloud radii by 10% relative to single- 
dish observations. Line widths are largely unaffected by 
interferometer observations. The magnitude of the sys- 
tematic bias depends on the array configuration and the 
sensitivity of the observations. In general, wider coverage 
of the (u, v) plane produces better property recovery. To 
minimize bias, observers should favor observations from 
several array configurations or from arrays with many 
elements. If possible, the interferometer data should be 
supplemented with single-dish observations. Finally, the 
decomposition algorithm used to separate emission into 
physically relevant structures will systematically affect 
molecular cloud properties. To date, there is no algo- 
rithm that should be favored in all circumstances, but 
any comparative study of GMC properties should be con- 
sistent in the choice of algorithm. For example, the re- 
sults of a CLUMPFIND algorithm applied to an extra- 
galactic data set are not directly comparable to a cata- 
log produced by a simple contouring method applied to 
Milky Way data. Provided the same algorithm is used 
across multiple data sets, referencing algorithm parame- 
ters to a common physical scale will minimize systematic 
differences. 

We will make a software version of the decomposition 
and measurement algorithm available as an IDL pack- 
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age. Further, since methodology can affect the results of 
GMC studies so strongly, we encourage authors working 
in the field to make their data available to the commu- 
nity after publication in order to facilitate future rigorous 
comparisons. 
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APPENDIX 

APPENDIX: A NEW DECOMPOSITION ALGORITHM 

The choice of what emission to identify as a GMC may be the single largest source of uncertainty in measuring 
and comparing G MC properties. A number of methods have been employed over the years, from simple contour- 
ing methods (e.g. ISanders et al.l Il985t IDam e et alJl!986t ISolomon et al.lll987t IHev er et alJl2001() to fitting three- 
dimensional Gaussians (GAUSSC LUMPS. IStutzki fc GustedlToQoF TKrarrieret^lllT998j) . to modified watershed algo- 
rithms (CLUMPFIND and its kin, iWilliams et al.lll994HBrunt et al]l2003[h 

In this section we present a new decomposition algorithm that is designed to identify clouds at low sensitivities while 
avoiding the introduction of a false clouds due to noise. This algorithm consists of two parts: identifying regions of 
significant emission in the data set and then assigning this emission to individual "clouds." 



Signal Identification 

We first identify regions of contiguous, significant emission in our position-position-velocity data cube. We estimate 
the noise in the data set by measuring the RMS intensity, ctrms, from a signal- free region of the data cube. We 
then construct a high-significance mask. This mask includes only adjacent channels that both have intensities above 
4o-rms- We expand that mask to include all emission above a lower threshold — typically two channels above 2o~rms 
significance — that is connected to the original high significance mask through pixels with > 2/jrms significance. The 
resulting mask contains most of the significant emission in the data cube. Lowering the threshold below two channels 
at « 1.5tTijMS runs the risk of biasing the moment measurements towards high values by including false emission 
(noise with positive values) in the cloud. 
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Cloud Identification 



In this section, we describe the algorithm used to decompose a region of emission into individual subsections rep- 
resenting the physically distinct entities in the data ( "clouds" ) . Through the description of the algorithm, there are 
several parameters that can be varied to produce changes in the resulting decomposition. In general, we set these 
parameters using physical prior knowledge of the GMCs we are seeking to catalog. We discuss the choice of these 
parameters in the following section. 

1. Discard Small or Low Contrast Regions: If a region is too small for us to measure meaningful properties from 
it, we discard the region. We require that each region has an area larger than two beam sizes, so that we can 
measure its size; and a velocity width of more than a single channel, so that we can measure its line width. If the 
intensity contrast between the peak and the edge of the region is less than a factor of two, we lack the dynamic 
range needed to correct the sensitivity bias and we therefore discard the region. If a region is not discarded we 
proceed to the next step. 

2. Rescale the Data to Reduce the Effects of Substructure: Molecular clouds contain significant substructure that 
confuses the decomposition of these sources. The substructure is often significantly brighter than bulk of the gas 
in the cloud. We rescale the data to reduce the contrast between this substructure and the cloud as a whole. 
The data are rescaled using the following transform: 



This transform reduces the contrast pixels with T > 2T c u p while preserving the relative brightness distribution. 
The value of T c u p is left as a free parameter. The transformed data are used in the decomposition algorithm. 
Such brightness trans forms are frequ ently used in the decomposition algorithms used in other fields such as 
medical imaging (e.g. iLin et al.ll2f)0l . 

3. Identify Independent Local Maxima: We identify potential local maxima, by identifying the elements in the data 
cube that are larger than all their neighbors. We consider neighbors to be all data that lie within a box with 
side length D max in position and AV max in velocity centered on the local maximum. These are our "candidate 
maxima." The parameters D max and AV max are free parameters. If we find more than one candidate maximum 
in a region of emission, we proceed to the next steps to further verify each maximum's independence. If we find 
a single candidate maximum then we label the region as a cloud and measure its properties as described in the 
main paper. 

4. Find Shared Isosurfaces and Reject Small Clouds or those with Smooth Mergers: For each pair of candidate 
maxima we calculate the value of the highest intensity isosurface to contain both maxima. We refer to this 
highest shared isosurface as the merge level. Using this set of highest shared isosurfaces, we calculate three 
properties of interest: 

(a) The area uniquely associated with each maximum (i.e. the area above the merge level for that maximum). 

(b) The antenna temperature interval between the merge level and each maximum, referred to as the contrast 



(c) The fractional amount by which each of the (unextrapolated) moment values changes across each shared 



i. None of the second moments increase by more than 100% for both maxima. 

ii. No two of the second moments increase by more than 50% for both maxima. 

iii. The flux increases by less 200% for both maxima. 

We use these three properties to pare maxima from the region. We reject maxima associated with small areas 
(less than two beam sizes, as for the region above) and contrast intervals less than A T mrtT , a free para meter. 
Choosing AT max > 2a significantly reduces the effects of noise on decomposition (e.g. IBrunt et al.ll2003j) since 
noise is associated with low contrast intervals. Finally, when a pair of maxima merge smoothly across a shared 
isosurface, we keep the higher intensity maximum and discard the lower one. This is a conservative choice in the 
decomposition algorithm: unless merging the two kernels significantly alters the properties of one of the clouds 
associated with the separate kernels, we assume the kernels are not physically distinct. The effect of removing 
kernels that merge smoothly from our data set is to reduce the algorithm's sensitivity to substructure within 
clouds. We iterate this step until we have a set of maxima associated with the required areas and separated from 
each other by significant jumps in their properties. 

5. Define Clouds Using Shared Isosurfaces: The surviving maxima each correspond to a "cloud." That cloud 
consists of the emission within the lowest intensity isosurface uniquely associated with the cloud. Emission that 
lies below this isosurface is part of a "watershed" shared among clouds and we do not assign it to any cloud. By 
not considering contested emission, i.e. emission that could be associated with distinct local maxima, we avoid 
the problem of how to properly assign this emission to local maxima. 




interval. 



isosurface (i.e. ° mai for each maximum across each shared isosurface). We consider the two clouds to 
merge smoothly across the isosurface only if: 
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TABLE A2 
Parameter Choices for Decomposition 
Algorithm 



Parameter 



Data-Based 



CMC Physical Priors 




max 



max 



max 



OO 

1 beam width 
1 channel 
2ci!Jfs 



2 km s 
1 K 



2.5 K 
15 pc 



-i 



6. Measure Cloud Properties: Finally, we apply the methods described in §2 and 3 to derive spatial sizes, line 
widths, and luminosities for each cloud. The transformed data (Step [^Jl are inverse transformed into the original 
brightness units for this analysis. 



Several of the algorithm's parameters are left to the choice of the user. Without any prior knowledge of the physical 
objects in the data, we establish default values for these parameters that will produce a reasonable decomposition 
based solely on the characteristics of the data. These defaults are chosen to provide sensitivity to real substructure 
within the data without contamination by noise and can be regarded as the minimum appropriate values for these 
parameters in most cases. The free parameters in the algorithm are the brightness transform threshold (T c n p , Step[5J) 
position and velocity window size used in searching for local maxima (D max and AV max , Step 03), and the minimum 
contrast temperature (AT max , Step l4b|l . Without prior knowledge of substructure in the data, no brightness transform 
should be applied (T c u p — oo). The minimum values for the parameters in the search for initial local maxima are 
set by the resolution of the data: D max = 0beam and AV max = AV c han or else separations between local maxima 
cannot be resolved. Finally, we use AT rnax = 2o~rms to prevent the noise in the data set from being recognized by 
the algorithm as legitimate structure and becoming the basis for the decomposition. 

In the main section of the paper, we developed methods that measured physical properties of molecular emission 
without observational bias. Ideally, the decomposition algorithm used in the analysis would also be free of observational 
bias. A good algorithm would, for example, decompose the same emission into similar structures regardless of the 
resolution and sensitivity of the observations. To progress towards this goal, we set the parameters of the decomposition 
algorithm to have similar values in physical units (pc, km s _1 , K) rather than data units (beam widths, channels, 
o~rms)- 

In the molecular ISM, which has structure on a range of scales, the choice of the physical values for the algorithm 
parameters must be motivated by prior knowledge of the objects that we wish to identify. The choice of parameters for 
identifying GMCs would be different from the choice of parameters for identifying clumps and cores. As a population, 
GMC s have size scales of 10s of parsecs, line widths of several km s _1 , and brightness temperatures of T < 10 K 
(e.g. iSolomon et al.lll987j) . In contrast, the clumpy substructure within clouds ha s a size scale of 1 pc , line widths 
of order 1 km s _1 and can have brightness temperatures of 30 K or higher (e.g. W illiams et al]ll994j) . We select 
the parameters of the algorithm to find molecular clouds rather than the clumps within them. In particular, we set 
the minimum separation between local maxima as D max — 15 pc and the velocity separation to be AV max — 2 km 
s _1 . We also fix a contrast interval in antenna temperature of AT max = 1 K rather than the data-driven value of 
2a rms- Finally, we must account for the presence of bright molecular gas substructure within molecular clouds. For 
example, Orion A has a few distinct regions separated by > 15 pc with antenna temperatures in excess of 15 K 
(TWilson et alJl200l . These are associated with hot molecular gas around young stars like the Trapezium cluster. In 
general, the typical kinetic temperature of gas in GMCs is ~ 10 K so brightness temperatures in excess of 10 K are 
usually associated with structure within molecular clouds not changes from cloud to cloud. Hence, to catalog clouds 
and not substructure, we must reduce the influence of the high Tt> data with the parameter T c u p . We u se T^ur, = 2.5 K 
to ma intain the full sensitivity for the gas separating GMCs (which typically has Ta < 5 K in the Sol omon et alJ 
(1987) data) while reducing the influence of brighter gas associated with a single GMC. Using T c n p = 2.5 K has little 
influence on extragalactic data where beam deconvolution typically averages out the presence of bright substructure 
within the clouds. 

We summarize our choices of parameters in Table lA"2l when the parameters are motivated by the data (Data-Based) or 
by physical assumptions about the structures being extracted (GMC Physical Priors). The resolution or the sensitivity 
of a data set may be sufficiently poor that the physical parameters are unattainable in the data set (e.g. Obeam > 15 pc, 
2<J rms > 1 K). In this case, the decomposition must be regarded with caution as it may not be directly comparable to 
other data sets. The adopted values are appropriate for decomposing data sets of 12 CO emission. The values of T c u p 
and AT max would be different for other tracers. 

Comparison with Existing Algorithms 

To demonstrate that the proposed algorithm is actually an improvement on existing methods, we analyze trial 
data sets with known properties using this algorithm and compare the results to those from the GAUSSCLUMPS and 



Using Physical Priors to Establish Algorithm Parameters 



Bias-free GMC Properties 



17 



10 I 




0.0 0.5 1.0 1.5 2.0 2.5 3.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 

Separation / Resolution Size / Resolution 

Fig. A9. — The mean number of clouds with Ty^akl^RMS > 5 recovered by three different decomposition algorithms on trial data, 
(left) The three algorithms applied to a blend of two Gaussian clouds. The results of using algorithm from the present study are minimally 
affected by the presence of noise, (right) The three algorithms applied to a circular top-hat cloud showing the effects of non-Gaussian shape 
on the results of the algorithms. The current algorithm is the best for grouping together large structures with non-Gaussian shapes. 



CLUMPFIND algorithms to the trial dat a. We use the CLFIN D algorithm implemented in MIRIAD ijSault et alll995|) 
and our own IDL implementation of the iKramer eTaTI ((199^ GAUSSCLUMPS algorithm, which we have compared 
to the standard algorithm operating on a real data set with satisfactory results. 

We first examine how adept the three algorithms are at decomposing a pair of blended clouds. We construct a series 
of data cubes with two unresolved model clouds separated in position by a variable distance. The model clouds each 
have a peak signal-to-noise ratio of 10 with a Gaussian line profile. For each value of the separation, we decompose 
100 data cubes with different rea lizations of the noise using the three algorithms, using data based choices for the 
algorithm parameters. Figure lA"9l (left') shows the mean number of clouds recovered by each algorithm as a function 
of the separation (we count clouds with peak signal-to-noise larger than 5 as "recovered"). As the trial clouds are 
moved farther apart, the typical number of clouds detected by each algorithm increases by 1. However, only the 
algorithm presented here consistently recovers a single cloud at low separations. GAUSSCLUMPS and CLUMPFIND 
produce false clouds from the noise. The jump in the number of clouds detected by the GAUSSCLUMPS algorithm 
occurs where the separation equals the resolution, whereas both the current algorithm and CLUMPFIND are able to 
distinguish the clouds only if their separation is over 1.5 times the resolution. GAUSSCLUMPS appears to be able to 
distinguish tight blends of clouds but is the most susceptible to noise. 

As a second test of the algorithms, we compared the number of clouds that the algorithms recover from a single 
cloud with a circular top-hat brightness profile (Ta — const, for r < Rq; otherwise) and peak signal-to-noise of 
10. The size of the cloud is varied with respect to the resolution of the data set and 100 data sets for each value of 
Ro are decomposed by each algorithm. The non-Gaussian brightness profile confounds all of the algorithms but to 
varying degrees. CLUMPFIND detects an increasing number of spurious clouds as the cloud grows, suggesting that 
the number of false clouds grows with the volume studied. Despite the non-Gaussian profile, GAUSSCLUMPS does 
surprisingly well with large sources. The current algorithm, however, does the best job of detecting a single source in 
the presence of noise. For analyzing data with a relatively low signal-to-noise, we find the decomposition algorithm 
presented in this Appendix should be favored for identifying clouds. 

Applying the Algorithm to the Orion- Monoceros Region 

The Orion Molecular Cloud is among the best studied of all GMCs, so it is an good place to compare the results of 
our methods to those of previous work. For this comparison , we use the data and results of the recent, uniform survey 
of the entire Orion-Monoceros region bv I Wilson et alJ l)2005|) . We analyze their final data set using the metho ds in t his 
paper (with physical priors for the decomposition algorithm parameters) and summarize the results in Fieure lAlOl an 
integrated intensity map of the region. The decomposition results in 81 molecular clouds, most of which are associated 
with the Galactic plane at the top of the Figure. 

The results of the algorithms decomposition of the region are good: major molecular clouds are identified as single 
entities in the decomposition. The algorithm succeeds where other cloud identification schemes would face difficulty. 
Nearly all of the emission in the data is connected above a single isosurface so that simple contouring methods 
would identify the e ntire Orion-Monoceros region as a single molecular cloud, though more complicated algorithms 
l|Scoville et alJll987|) may succeed. CLUMPFIND and GAUSSCLUMPS would isolate individual peaks and decompose 
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Fig. A10. — Masked, integrated-intensity map of the Orion-Monoceros region with a logarithmic stretch. The ellipses plotted over the 
data indicate the positions, sizes and orientations of molecular clouds identified by our decomposition algorithm. In general, the algorithm 
does a good job of identifying known molecular clouds as single entities. Notable molecular features are labeled. 



TABLE A3 

Properties of Major Molecular Clouds in Orion-Monoceros 



Name 


Distance a 


^human 


-^algorithm 


Re 


(Tv 


a 




(pc) 


(10 4 M o ) 


(1O 4 M ) 


(PC) 


(km s- 1 ) 




Orion A 


480 


12. 


11. ±0.06 


18.6 ±0.2 


2.9 ±0.1 


1.6 ±0.1 


Orion B 


500 


9.1 


5.6 ±0.08 


12.1 ±0.3 


1.5 ±0.1 


0.6 ±0.1 


Orion East 


120 


0.013 


0.013 ±0.001 


1.2 ±0.1 






Mon R2 


800 


12. 


11. ± 0.5 


25.9 ± 1.2 


1.6 ±0.1 


0.7 ±0.1 


Crossbones 


470 


1.9 


0.57 ± 0.02 


11.0 ±0.6 


0.8 ±0.1 


1.5 ±0.3 


Northern Filament 


390 


1.9 


1.2 ±0.07 


8.6 ± 0.7 


1.7 ±0.1 


2.6 ±0.5 



a Adapted from Table 2 of lWilson eTai] [2005). 



clouds into their substructure — as was intended by their design (CLUMPFIND identifies 617 clumps in the same 
data). 

The properties of the clouds agree well with the values published for a human decompos ition of the emission . The 
results of the analysis are given in Table IA"31 For comparison, we adopt the distances of lWilson et alJ ((2.005). We 
compare the results of the algorithm to their results, after scaling their mass up by 10% to account for a difference 
in the adopted CO-to-H2 conversion factors. In several cases the masses agree quite well (to within 5%, Orion A, 
Mon R2, Scissors), while other features show ~ 30% differences. These systematic discrepancies arise from differences 
in how emission is assigned into structures. For example, the algorithm only identifies the central region of Orion 
B as a molecular cloud; it does not include emission near the location of Orion East that is nominally part of the 
Orion B cloud because the assignment of this emission is contested with neighboring clouds. Similarly, the algorithm 
characterizes the Northern Filament region as five distinct clouds rather than the single large cloud that a human 
decomposition produced. Despite these differences, the results of the algorithm are reassuring - in most cases, the 
well-known molecular clouds are identified as single clouds and there is good agreement between their derived physical 
properties and the results of previous studies. 
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Fig. All. — The Orion-Monoccros Region convolved to the typical signal-to-noise (S/N) and spatial resolution (beam size) of extragalactic 
observations. The integrated intensity map of the cloud is drawn as a contour plot with the positions and sizes of clouds that the algorithm 
identifies drawn as gray ellipses. Contours on the integrated intensity map are drawn at -0.1 (dashed), (bold), 0.1, 0.2, ... 1.0 times the 
peak value in the map. The sizes of the clouds are not deconvolved from the beam, so coarser spatial resolution results in objects that are 
apparently larger. Except in the marginally resolved, low S/N case, the algorithm successfully identifies Orion A and B. Mon R2 and the 
Northern Filament appear in maps with higher resolution and signal-to-noise. 



Decomposing Orion-like Clouds in Other Galaxies 

As a final test of the decomposition and analysis algorithm, we apply the algorithm to simulated observations of the 
Orion-Monoceros region with beam sizes and signal-to-noise values typical of extragalactic GMC observations. We 
simulate observations by convolving the data set to the desired resolution and adding a convolved data cube of noise 
which is scaled up to give the desired peak signal-to-noise ratio in the data. We perform this for a peak signal-to-noise 
(S/N) values of 10 and 30 combined with beam sizes of 10, 20 and 50 pc. The results of the decompositions are 
displayed in Figure lAl II 

At coarse resolution and low sensitivity, the fainter clouds are undetectable. However, the algorithm identifies each 
of Orion A, Orion B, Mon R2 and the Northern Filament in at least one of the trial data sets. At high resolution 
(10 pc) and peak signal-to-noise (S/N = 30), the algorithm successfully identifies the clouds with propertie s tha t are 
consistent, within the uncertainties determined by the algorithm, with the masses of the clouds in Table IA3I The 
systematic effects of poor resolution manifest themselves for beam sizes > 20 pc where the decomposition of blended 
emission results in ~ 20% variations in the properties of the clouds relative to those found in the original data set for 
both high and low S/N. For a beam size of 20 pc, the clouds are not resolved since this is over twice the size of the 
clouds along their minor axes. Finally, at 50 pc resolution, Mon R2 is not found by the algorithm and high S/N is 
required to distinguish Orion A and Orion B. There is a variation of 100% (0.3 dex) in the derived physical parameters 
for the clouds; this variation is reflected in the estimates of the uncertainties. As expected based on the 20 pc resolution 
data, the clouds are not spatially resolved for a beam size of 50 pc. An accurate decomposition of emission into typical 
Galactic GMCs requires a beam size ~ 20 pc though only a modest sensitivity is required: S/N > 10. 



